A Multilingual Natural-Language Interface To Regular Expressions

نویسنده

  • Aarne Ranta
چکیده

A b s t r a c t . This report explains a natural-language interface to the formalism of XFST (Xerox Finite State Tool), which is a rich language used for specifying finite state automata and transducers. By using the interface, it is possible to give input to XFST in English and French, as well as to translate formal XFST code into these languages. It is also possible to edit XFST source files and their natural-language equivalents interactively, in parallel. The interface is based on an abstract syntax of the regular expression language and of a corresponding fragment of natural language. The relations between the different components are defined by compositional interpretation and generation functions, and by corresponding combinatory parsers. This design has been inspired by the logical grammar of Montague. The grammar-driven design makes it easy to extend and to modify the interface, and also to link it with other functionalities such as compiling and semantic reasoning. It is also easy to add new languages to the interface. Both the grammatical theory and the interface facilities based on it have been implemented in the functional programming language Haskell, which supports a declarative and modular style of programming. Some of the modules developed for the interface have other uses as well: there is a type system of regular expressions, preventing some compiler errors, a denotational semantics in terms of lazy lists, and an extensio~ of the XFST script language by definitions of functions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Corpus and Semantic Parser for Multilingual Natural Language Querying of OpenStreetMap

We present a corpus of 2,380 natural language queries paired with machine readable formulae that can be executed against world wide geographic data of the OpenStreetMap (OSM) database. We use the corpus to learn an accurate semantic parser that builds the basis of a natural language interface to OSM. Furthermore, we use response-based learning on parser feedback to adapt a statistical machine t...

متن کامل

Initial Experiments with Multilingual Extraction of Rhetoric Figures by means of PERL-compatible Regular Expressions

A language-independent method of figure-ofspeech extraction is proposed in order to reinforce rhetoric-oriented considerations in natural language processing studies. The method is based upon a translation of a canonical form of repetition-based figures of speech into the language of PERL-compatible regular expressions. Anadiplosis, anaphora, antimetabole figures were translated into the form e...

متن کامل

Natural Language Modeling in a Machine Translation Prototype for Healthcare Applications: a Sublanguage Approach

This paper discusses methodological issues related to natural language modeling in the framework of the LRE project ANTHEM 1. The objective of ANTHEM is to develop a portable prototype of a multilingual natural language interface that allows users of Healthcare Information Systems to enter diagnostic expressions using their own natural language, and to have this input translated in whatever for...

متن کامل

Towards Development of Multilingual Spoken Dialogue Systems

Developing multilingual dialogue systems brings up various challenges. Among them development of natural language understanding and generation components, with a focus on creating new language parts as rapidly as possible. Another challenge is to ensure compatibility between the different language specific components during maintenance and ongoing development of the system. We describe our expe...

متن کامل

Extending Tableaux Calculus with Limited Regular Expression for Role Path: an Application to Natural Language Processing

The main challenge in a natural language interface for databases is to provide easy portability and fast customization for a new database. In this focus, we try to design a simple syntactic analyser that could be plugged to a database model with minimum eeorts. We use consistency test and classiication capabilities of description logics to solve ambiguities and semantic shortcuts. The introduct...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998